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Question 5 


Intent of Question 





The primary goals of this question were to assess students’ ability to (1) calculate appropriate probabilities, 
including conditional probabilities, from a two-way table; (2) determine from a two-way 

table whether two events are independent; (3) identify an appropriate test procedure for assessing 
independence between two categorical variables. 


Solution 
Part (a): 


Using the addition rule, the probability that the randomly selected adult is a college graduate or 
obtains news primarily from the internet is: 


P(college graduate or internet) = P(college graduate) + P(internet) — P(college graduate and internet) 
_ 693 687 245 _ 1135 _ 
~ 2500 : 2500 2500 2500 — ea 





Part (b): 


Reading values from the table, the conditional probability that the selected adult receives news 


primarily from the internet given that he or she is a college graduate is: aaa = 0.354. 


Part (c): 


These events are not independent. One way to establish this is to note that the unconditional 


probability equals P(obtains news primarily from the internet) = O87 = 0.275 , but the conditional 


2500 
probability equals P(obtains news primarily from the internet /is a college graduate) = 0.354. Because 


these two probabilities are not equal, the events “is a college graduate” and “obtains news primarily 
from the internet” are not independent. 


Part (d): 


Chi-square test of association (or independence), with 
degrees of freedom = (# of rows — 1) x (# of columns — 1) = (5-1) x (3-1) =8. 


Scoring 
Parts (a), (b), (c) and (d) are each scored as essentially correct (E), partially correct (P) or incorrect (I). 
Part (a) is scored as follows: 


Essentially correct (E) if the probability is computed correctly and appropriate work is shown OR the 
probability calculation is set up correctly but a minor computational error is made. 
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Question 5 (continued) 


Partially correct (P) if the probabilities of the two events are added without subtracting the probability 


1380 _ 
5509 = 0-552. 





of their intersection, resulting in 


OR 
Independence is assumed in computing the probability of the intersection. 


Incorrect (I) if the response does not meet the criteria for an E or P, or includes the correct decimal 
answer with no accompanying work or justification. 


Note: An answer of 1135 in fraction form is sufficient to be scored as essentially correct (E). 


2500 





Part (b) is scored as follows: 


Essentially correct (E) if the conditional probability is correctly computed and appropriate work is 
included OR if the calculation is set up correctly but a minor computational error is made. 


Partially correct (P) if the reverse conditional probability (being a college graduate given that he or she 
245 


primarily obtains news from the internet) is computed, resulting in 687 > 0.357. 


Incorrect (I) if the probability of the intersection of the two events is computed, resulting in 


245 
3500 > 0.098. 
OR 
The unconditional probability of obtaining news primarily from the internet is computed, resulting in 
687 
=~ = 0.275. 
2500 pate 
OR 


The response otherwise fails to meet the requirements for an E or P. 


Notes 

e An answer of 2S in fraction form is sufficient to be scored as essentially correct (E). 
e Acorrect decimal answer with no work or justification is scored as incorrect (I). 

Part (c) is scored as follows: 


Essentially correct (E) if the response states that the events are not independent and gives a correct 
numerical justification based on the table. 


Partially correct (P) if the response states that the events are not independent and gives a correct 
statistical justification, but numerical support is not included (for example, says that P(C/I) + P(C) but 
never reports either probability) OR the response includes correct and relevant calculations related to 
independence of these events but reaches an incorrect conclusion that the events are independent. 


Incorrect (I) if the response states that the events are not independent but the given justification is not 
based on a correct probability argument OR the response does not reveal an understanding of how to 
assess whether two events are independent by comparing appropriate probabilities. 
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Question 5 (continued) 


Part (d) is scored as follows: 


Essentially correct (E) if the chi-square test of association (or independence) is correctly identified and 
the correct degrees of freedom are given. 
Note: It is not necessary to show work in calculating the degrees of freedom. 


Partially correct (P) if the response includes the correct name (chi-square test of association or 
independence) but not the correct degrees of freedom. 


Incorrect (I) if the response includes neither identification of the chi-square test of association or 
independence nor correct degrees of freedom. 


Notes 


If the response includes only “chi-square test” without specifying “of association (or 
independence),” this part is scored as essentially correct (E) provided that the degrees of freedom 
are computed correctly but as incorrect (I) if the degrees of freedom are incorrect. 

If the response identifies the test as “chi-square test of goodness-of-fit” or “chi-square test of 
homogeneity of proportions,” the response is scored as incorrect (J). 

If the response does not name a correct test and only gives correct degrees of freedom, the 
response is scored as incorrect (I). 


Each essentially correct (E) part counts as 1 point. Each partially correct (P) part counts as % point. 


4 


3 


2 


Complete Response 
Substantial Response 
Developing Response 


Minimal Response 


If a response is between two scores (for example, 2% points), use a holistic approach to determine whether 
to score up or down, depending on the strength of the response and communication. Also use the 
following guidelines: 


If part (a) was scored as partially correct (P), always score down. 

A holistic score of 1 may be given to a response with all four parts scored as incorrect (I), if parts (a) 
and (b) both provide correct decimal answers but received no credit because supporting work was 
not included. 
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5. An advertising agency in a large city is conducting a survey of adults to investigate whether there is an 
association between highest level of educational achievement and primary source for news. The company 
takes a random sample of 2,500 adults in the city. The results are shown in the table below. 


HIGHEST LEVEL OF EDUCATIONAL ACHIEVEMENT 


Seuree Not High High School Graduate — 
pec School But Not College College Graduate Total 
So Graduate - 
Local television Teas oo ee ee 
a 
oe AT ep 


(a) If an adult is to be selected at random from this sample, what is the probability that the selected adult is a 
college graduate or obtains news primarily from the internet? 











“The propability Ahek the jected adult & « colle ae aroducte. or obtains neks 


primarily frow tte Thterret is G3 431-245 13S . y. 
S00 ~ 25 -~ o.4s * 


(b) If an adult who is a college graduate is to be selected at random from this sample, what is the probability that 
the selected adult obtains news primarily from the internet? 


let—A teste the 


The pe bob’ tty thet the glected duit ocbtoms news prrar'ly: from te Ticternct 


In the Conditven that on adlutt the is G, College gradmete i te ho. selectect is 
ae 





Ss 
arco 23" 
a3 = a * 0.354 


200 
(c) When selecting an adult at random from the sample of 2,500 adults, are the events “is a college graduate” 
and “obtains news primarily from the internet” independent? Justify your answer. 


‘The prichty SG He gant “hs 4 college studegt™ ts Baily chile that 


iS ~ er 
Ror ** obtacns news prmerila frown ethe internet rs se . 
et: 3 _6en . 
cna, SEF Baie They ove nat Adqenctant, 


(d) The company wants to conduct a statistical test to investigate whether there is an association between 
educational achievement and primary source for news for adults in the city. What is the name of the 


statistical test that should be used? 
The Chi- square fest for independerca shatd be wed. 
What are the appropriate degrees of freedom for this test? §. 
The oppropevorte daevee> JE Peed Ge ths tet iS GHAR E) zB, 


(3 fs the mwnber oF coltunns, pale S cs the nucber shes.) 


GO ON TO THE NEXT PAGE. 
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5. An advertising agency in a large city is conducting a survey of adults to investigate whether there is an 


association between highest level of educational achievement and primary source for news. The company 
takes a random sample of 2,500 adults in the city. The results are shown in the table below. 


HIGHEST LEVEL OF EDUCATIONAL ACHIEVEMENT 


Primary S Not High High School Graduate 
for Nowe School But Not College College Graduate 
Graduate Graduate 













[Newspapers [49 20588 
|Localtelevision [| 90 | TO S885 
Cable television 





[Intemet | 4 | SBT 
[Total | 870 1,437 


(a) If an adult is to be selected at random from this sample, what is the probability that the selected adult is a 
college graduate or obtains news primarily from the internet? 


east (663-245) acy 






Oo ; 
nae sty of peal adult ue 3 college geod vate c 
pa areiees NewS Re bmee | 4 ftom Miecae * is 4, aa +/., 


(b) If an adult who is a college graduate is to be selected at random from this sample, what is the probability that 


the selected adult obtains news primarily from the internet? 
- Probab: Y Ly sh argadcriak 


G 
2 4 SE SS SS ef colle. graclate, phe 


6% 3 | Sets prince hy Nev sl Sourl-e 
trom Orheened 12 SF.3acy¢e 


(c) When selecting an adult at random from the sample of 2,500 adults, are the events “is a college graduate” 
and “obtains news primarily from the internet” a Di a your Nel saiaaas 
No + he, Ae ne + Inch € pe 


“nis ty deca (Ct¥(EeD 4 Cavseso9 


(d) The company wants to conduct a statistical test to investigate whether there is an association between 
educational achievement and primary source for news for adults in the city. What is the name of the 
statistical test that should be used? 


You pould vse a no” test. 


What are the appropriate degrees of freedom for this test? 


Are leo’ rate degrees of frieclom eel’ be (00 
aso is fu hig bess Cmeent tA ible Gi. 
GO ON TO THE NEXT PAGE. 
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5. An advertising agency in a large city is conducting a survey of adults to investigate whether there is an 
association between highest level of educational achievement and primary source for news. The company 
takes a random sample of 2,500 adults in the city. The results are shown in the table below. 


HIGHEST LEVEL OF EDUCATIONAL ACHIEVEMENT |: 


Prmiaty Sourte Not High High School Graduate 
me eis School . But Not College College Graduate Total 
Graduate Graduate 
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[Internet ss] 4 TS 87" 
1,437 
(a) If an adult is to be selected at random from this sample, what is the probability that the selected adult is a 
college graduate or obtains news primarily from the intemmet? ean 
—_—_——__ 4 . a . 
= (A) + PLB ) =P CAAR ). 5 There BA 
? (AVR? ¥ CANO Atpert WHE 
= 6 a3 A Gdt } 4 y 245 selected rd ht 1S 
"Zg00 2500 “ySee a, fone 
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(b) If an adult who is a college graduate is to be selected at random from this sample, what is the probability that 
the selected adult obtains news primarily from the internet? 


Alans new for 2Z4S- 35-235 tf. 


wna nae WOES = ae 
a se om B a 3-35 7- chance wnat ar adurk ee ‘ 
aeese Graduate B foe seleured at raurdorn fromm SAE OUT 
ihe celeclid adult obtairt Hass pyirmav! S fron Ahe 
(c) When selecting an adult at random from the sample of 2,500 adults, are the events “is a college graduate” 
and “obtains news primarily from the internet” independent? Justify your answer. 
No hey ort wok Iwdepcrdant because coilese dvaduates 
Tas one. \wiei he ae access *o internet Shan adults of cthev 
aroun o + educottsrl levels - 


(d) The company wants to conduct a statistical test to investigate whether there is an association between 
educational achievement and primary source for news for adults in the city. What is the name of the 
statistical test that should be used? 


What are the appropriate degrees of freedom for this test? 


QUAQA deqees ot Lewd Would br 


GO ON TO THE NEXT PAGE. 
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Question 5 


Sample: 5A 
Score: 4 


The probability calculation in part (a) is correct, and supporting work is shown. In part (b) the correct 
conditional probability is calculated, and supporting work is given. Parts (a) and (b) were scored as essentially 
correct. In part (c) the student incorrectly labels the event “is a college graduate” as “is a college student.” 
This was considered a minor error. The response shows that the probability of the intersection of the two 
events “is a college graduate” and “obtains news primarily from the internet” is not equal to the product of 
the two individual event probabilities and correctly concludes that the two events are not independent. 

Part (c) was scored as essentially correct. In part (d) the response correctly identifies the appropriate test as 
the “[c]hi-square test for independence” and includes a correct calculation of degrees of freedom. Part (d) was 
also scored as essentially correct. The entire answer, based on all four parts, was judged a complete 
response and earned a score of 4. 


Sample: 5B 
Score: 3 


The probability calculations in parts (a) and (b) are correct, and supporting work is given. These two parts 
were scored as essentially correct. In part (c) the response correctly concludes that the two events “is a 
college graduate” and “obtains news primarily from the internet” are not independent and supports this 
conclusion with an argument that is equivalent to showing that the product of the individual event 
probabilities is not equal to the probability of the intersection of the two events. Part (c) was scored as 
essentially correct. In part (d) the chi-square test is identified, but the response does not specify that the test 
is for association or independence. This omission, accompanied by incorrect degrees of freedom, resulted in 
a score of incorrect for part (d). The entire answer, based on all four parts, was judged a substantial 
response and earned a score of 3. 


Sample: 5C 
Score: 2 


The probability calculation in part (a) is correct, and supporting work is shown. In part (b) the correct 
conditional probability is calculated, and supporting work is given. Parts (a) and (b) were scored as essentially 
correct. Although the response indicates that the two events “is a college graduate” and “obtains news 
primarily from the internet” are not independent, no statistical justification is provided, so part (c) was scored 
as incorrect. In part (d) no test procedure is identified, and the degrees of freedom given are incorrect. Part (d) 
was scored as incorrect. The entire answer, based on all four parts, was judged a developing response and 
earned a score of 2. 
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